Online Learning under Delayed Feedback

نویسندگان

  • Pooria Joulani
  • András György
  • Csaba Szepesvári
چکیده

Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversarial problems, and in an additive way in stochastic problems. We give meta-algorithms that transform, in a black-box fashion, algorithms developed for the non-delayed case into ones that can handle the presence of delays in the feedback loop. Modifications of the well-known UCB algorithm are also developed for the bandit problem with delayed feedback, with the advantage over the meta-algorithms that they can be implemented with lower complexity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Learning with Adversarial Delays

We study the performance of standard online learning algorithms when the feedback is delayed by an adversary. We show that online-gradient-descent [1] and follow-the-perturbed-leader [2] achieve regret O( √ D) in the delayed setting, where D is the sum of delays of each round’s feedback. This bound collapses to an optimal O( √ T ) bound in the usual setting of no delays (where D = T ). Our main...

متن کامل

The Effect of Knowledge of Result Feedback Timing on Speech Motor Learning in Healthy Adults

Objectives: The current study mainly aimed at studying the effect of Knowledge of Result (KR) feedback timing and result-estimation opportunity before receiving delayed KR on learning a new speech motor skill in monolingual healthy adults.  Methods: Thirty-nine Persian healthy adults were randomly divided into three groups. Each group received immediate KR, delayed KR (after eight seconds), or...

متن کامل

Asynchronous Online Discussion Forum: A Key to Enhancing Students’ Writing Ability and Attitudes in Iran

This paper focuses on the impact of an asynchronous online discussion forum on the development of students’ ability in and attitudes toward writing in English. Two groups of third-year students (N = 60) majoring in English were assigned to two treatment and control groups, each receiving different types of feedback. Students in the treatment group were required to participate ...

متن کامل

Effects of Receiving Corrective Feedback through Online Chats and Class Discussions on Iranian EFL Learners' Writing Quality

Giving corrective feedback (CF) is an essential part of the teaching and learning process, and the way it should beneficially be done has been the focus of attention for numerous researchers especially when traditional ways of CF provision are not possible, particularly in rare situations such as outbreaks of diseases. This study investigated how different ways of giving feedback; namely, throu...

متن کامل

Keeping Things that Matter: An Exploration on Delayed Feedback Online Learning

Online learning algorithms operate on a single instance at a time, allowing for updates that are fast, simple and perform well in a wide range of practical settings [1]. In this paper, we focus on online learning algorithms under delayed feedback, where the true labels arrive minutes, hours or even days later. Current solutions to this problem assume that all the instances are kept until the co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013